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Abstract — The sonar images provide a rapid view of the 
seabed in order to characterize it. However, in such as uncertain 
environment, real seabed is unknown and the only information 
we can obtain, is the interpretation of different human experts, 
sometimes in conflict. In this paper, we propose to manage this 
conflict in order to provide a robust reality for the learning 
step of classification algorithms. The classification is conducted 
by a multilayer perceptron, taking into account the uncertainty 
of the reality in the learning stage. The results of this seabed 
characterization are presented on real sonar images. 

I. Introduction 

The seabed characterization serves many useful purposes, 
e.g help the navigation of Autonomous Underwater Vehicles 
or provide data to sedimentologists. In such sonar applica- 
tions, seabed images are obtained with many imperfections 
.[1]. Indeed, in order to build images, a huge number of 
physical data (geometry of the device, coordinates of the ship, 
movements of the sonar, etc.) are taken into account, but these 
data are polluted with a large amount of noises caused by 
instrumentation. In addition, there are some interferences due 
to the signal traveling on multiple paths (reflection on the 
bottom or surface), due to speckle, and due to fauna and flora. 
Therefore, sonar images have a lot of imperfections such as 
imprecision and uncertainty; thus sediment classification on 
sonar images is a difficult problem even for human experts. In 
this kind of applications, the reality is unknown and different 
experts can propose different classifications of the image. 
Figure Q] exhibits the differences between the interpretation 
and the certainty of two sonar experts trying to differentiate the 
type of sediment (rock, cobbles, sand, ripple, silt) or shadow 
when the information is invisible. Each color corresponds to a 
kind of sediment and the associated certainty of the expert for 
this sediment expressed in term of sure, moderately sure and 
not sure. Thus, in order to learn an automatic classification 
algorithm, we must take into account this difference and the 
uncertainty of each expert. For example, how a tile of rock 
labeled as not sure must be taken into account in the learning 
step of the classifier and how to take into account this tile if 
another expert says that it is sand? 

Textured image classification, such as sonar image, is gen- 
erally done on a local part of the image (pixel, or most of 
the time on small tiles of e.g. 16x16 or 32x32 pixels). Usual 



sonar image classification methods are usually supervised [2], 
[3], [1] and can be described into three steps. First, significant 
features are extracted from these tiles. Generally, a second 
step in necessary in order to reduce these features, because 
they are too numerous. In the third step, these features feed 
classification algorithms. The particularity in considering small 
tiles in image classification is that sometimes, two or more 
classes can co-exist on a tile. How to take into account the 
tiles with more than one sediment? 
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Fig. 1, Segmentation given by two experts. 

Many fusion theories can be used for the experts fusion in 
image classification such as voting rules [4], [5], possibility 
theory [6], [7], belief function theory [8], [9], [10], [1 1]. In our 
case, experts can express their certitude on their perception. 
As a result, probabilities theories such as the Bayesian theory 
or the belief function theory are more adapted. Indeed, the 
possibility theory is more adapted to modelize the imprecise 
data whereas probability-based theories is more adapted to 



modelize the uncertain data. Of course both possibility and 
probability-based theories can imitate imprecise and uncertain 
data at the same time, but not so easily. That is why our 
choice is conducted on the belief function theory, also called 
the Dempster-Shafer theory [8], [9] or the Transferable Belief 
Model [10], [11]. We can divide the fusion approach into 
two levels: the credal level and the decision level. The credal 
level can be described into three stages: the belief function 
model, the estimation of some parameters depending on the 
model (not always necessary), and the combination. The most 
difficult step is presumably the first one: the belief function 
model from which the other steps follow. 

The paper is organized as follow: in a first section we recall 
the bases of the transferable belief model. Next, we present an 
approach of experts fusion in order to obtain a reality on our 
sonar images. We propose a new multialyer perceptron based 
on belief learning. In the last section, we show the result of 
the classification of sonar images. 

II. Transferable Belief Model bases 
A. Credal level 

1) Belief Function Models: Consider the space of discern- 
ment 9 = {Ci, Ca, . . . , Cat}, where Q is the hypothesis "the 
considered tile belongs to the class i". The belief functions can 
be expressed in several forms: the basic belief assignments 
(bba) m, the credibility function bel and the plausibility 
function pi, which are in one-to-one correspondence. 

The basic belief assignments (bba) m are defined by the 
mapping of the power set 2 e (defined by all the disjunctions 
of 6) onto [0, 1], with: 
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m{X) = 1. 
In the open world case [10]: 



(1) 



m(0) > 0. (2) 

These simple conditions in equation (ff) and fl}, give a large 
panel of definitions of the bba, which is one of the difficulties 
of the theory. The belief functions must therefore be chosen 
according to the intended application. 

The credibility function is given for all X e2 e by: 

bel(X)= Y m ( y )- < 3 > 

The plausibility function is given for all Ie2 e by: 

plpf) = Y = bel ( Q ) ~ bel(X c ), (4) 

Y£2 B ,YnX^i(/> 

where X c is the complementary of X. 

2) Combination rules: Many combination rules have been 
proposed these last years in the context of the belief function 
theory ([12], [13], [10], [14], [15], etc.). In the context of the 
TBM, the combination rule most used today seems to be the 
conjunctive rule given by [10] for all X G 2 e by: 

M 

m c (X)= Y II m ^)> < 5 > 
Y 1 n...r\Y M =x j=i 



where Yj € 2 is the response of the expert j, and rrij(Yj) 
the associated belief function. 

However, the conflict (that is given by m c (0)) can be 
redistributed on partial ignorance like in the Dubois and Prade 
rule [13], a mixed conjunctive and disjunctive rule given for 
all X G 2 e , X ^ by: 

M 

m DP (X)= Y H m i( Y j) 
Y 1 r\...nY M =x j=i 

+ Y fi^iYj), (6) 

YiU...UY M =* 3 = 1 

Yin...nY M =0 

where Yj G 2 is the response of the expert j, and m,j(Yj) 
the associated belief function. 

We have proposed another proportional conflict redistribu- 
tion rule [15] for M experts, for X G 2 e , X ^ 0: 



mpcR(X) = m c (X)+ 
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=3, if 3 < h 
= 3 + 1, if 3 > h 



(7) 



(8) 
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rrii{X) + m <Ti(j)(yvi(j)) 0' m c is the conjunctive 

3 = 1 

consensus rule given by the equation (|5]). This rule allows a 
proportional conflict redistribution on the subsets from where 
the conflict comes and is equivalent for two experts to the rule 
given in [16]. This rule will be illustrated on simple examples 
in the next section. 

These rules are compared in [17]. 

B. Decision level 

The decision is a difficult task. No measures are able to 
provide the best decision in all the cases. Generally, we con- 
sider the maximum of one of the three functions: credibility, 
plausibility, and pignistic probability. 

The pignistic probability, introduced by [18], is here given 
for all X G 2 e , with X ^ by: 

If the credibility function provides a pessimist decision, 
the plausibility function is often too optimist. The pignistic 
probability is usually taken as a compromise. 



III. Experts fusion 

In order to fuse the opinions of different experts on a given 
tile X, we have to take into account the certainty of experts 
and proportion of the two (or more) sediments but not only 
on one focal element. In this case, the space of discernment 
O represents the different kind of sediments on sonar images, 
such as rock, sand, silt, cobble, ripple or shadow (that means 
no sediment information). The experts give their perception 
and belief according to their certainty. For instance, the expert 
can be moderately sure of his choice when he labels one 
part of the image as belonging to a certain class, and be 
totally doubtful on another part of the image. Moreover, on 
a considered tile, more than one sediment can be present. 

Consequently we have to take into account all these aspects 
of the applications. In order to simplify, we consider only two 
classes in the following: the rock referred as A, and the sand, 
referred as B. The proposed models can be easily extended, 
but their study is easier to understand with only two classes. 

Hence, on certain tiles, A and B can be present for one or 
more experts. The belief functions have to take into account 
the certainty given by the experts (referred respectively as ca 
and cb, two numbers in [0, 1]) as well as the proportion of 
the kind of sediment in the tile X (referred as pa and pb, 
also two numbers in [0, 1]). We have two interpretations of 
"the expert believes A": it can mean that the expert thinks 
that there is A on X and not B, or it can mean that the expert 
thinks that there is A on X and it can also have B but he does 
not say anything about it. The first interpretation yields that 
hypotheses A and B are exclusive and with the second they 
are not exclusive. We only study the first case: A and B are 
exclusive. But on the tile X, the expert can also provide A 
and B, in this case the two propositions "the expert believes 
A" and "the expert believes A and B" are not exclusive. 

We propose a model considering only one belief function 
according to the proportion by: 

m(A) = pa-ca, 

m{B)=p B .c Bl (10) 
m(A U B) = 1 - {pa-ca +Pb-c b ). 

For instance, consider two experts providing their opinion 
on the tile X. The first expert says that on tile X there is some 
rock A with a certainty equal to 0.6. Hence for this first expert 
we have : pa — 1, p B = 0, and ca — 0.6. The second expert 
thinks that there are 50% of rock and 50% of sand on the 
considered tile X with a respective certainty of 0.6 and 0.4. 
Hence for the second expert we have: pa = 0.5, ps = 0.5, 
ca = 0.6 and cb = 0.4. We illustrate all our proposed models 
with this numerical exemple. 

Consequently, we have simply: 
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The non-normalized conjunctive rule, the credibility, the 



plausibility and the pignistic probability are given by: 
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In this case we do not have the possibility to decide on APiB, 
because the conflict is on 0. 
The PCR rule provides: 
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where 

mpcR(A) = 0.60 + 0.09 = 0.69, 
mp CR {B) = 0.08 + 0.03 = 0.11. 

With the PCR rule, the decision will be also A. 

Of course, we cannot say on this example which rule is the 
best, and we can apply these two rules in order to construct a 
reality taking into account the doubts of different experts. This 
reality can serve to train a classifier and also to evaluate this 
classifier. We can use many supervised classifiers. In the next 
section, we propose to introduced a new classifier: a multilayer 
perceptron based on belief learning, take into account all the 
reachness of the belief basic assignment. 

IV. Multilayer Perceptron based on Belief 
Learning 

We propose in this section a new belief multilayer percep- 
tron where the difference between the multilayer perceptron 
relates to the learning based on a belief learning. In [19], a 
neural network classifier based on Dempster-Shafer theory is 
presented. In this work, the neural network consider the bba 
at each neuron, that is not the case in our approach presented 
feedforward. 

A. A multilayer perceptron 

The neural network classifiers are today the most used 
supervised classifiers. The multilayer perceptron (MLP) is a 
feedforward fully connected neural network. 

The tile X is described by n features (x\, x n ). Each unit 
of the network is an artificial neuron called perceptron, with 
the structure given in figure [2] 

All the neuron outputs of every layer are connected to all 
the neuron inputs of the next layer weighted by values we 
have to learn. These weights are first initialized with small 
random values. In order to learn these values we present to 
the network the learning vectors and the corresponding desired 
outputs. The objective of the learning process is to minimize 
the quadratic error: 

1 N 

e= 2^(di-Si) 2 , (ID 




Fig. 2. Artificial neuron structure. 

where S{ are the obtained outputs of the multilayer perceptron 
and dj is 1 if the class of X is Cj and elsewhere. As shown 
on figure [2] we can use the sigmoid function given by: 

1 



/(*) = 



1 



(12) 



So we obtain the learning algorithm called the back propaga- 
tion algorithm for the iteration t+1: 



w hh (* + !) = w hh (*) + ^2 (*) s *i (*)> 



(13) 



where is the weight value between the neuron l\ of the 
first layer and the neuron I2 of the following layer, r] stands for 
the learning rate, sj^i) is the obtained output of the neuron 
li at the iteration t, and Si 2 (t) is given by: 



Si(t) = c Si (t)(l - Si(t))(di - Si (t)), 



(14) 



if I2 = i is on the output layer, where the constant c controls 
the slope of the sigmoid function, and 

S h (t)=cs h {t)(l-8 h (t))^2Si(t)w Ua (t), (15) 

1 

elsewhere. 

B. Belief learning 

The use of uncertain and imprecise data for learning have 
been used in [20], [21] for decision trees and in [22], [23] for 
a credal EM approach. In the previous approach, the learning 
set C is composed of K examples (X t ,Ct), t — 1,...,K, 
where X t is a tile (a n-dimensional vector given by n features 
calculated on the tile) and Ct £ 6 the class of X t . The learning 
set is also given by the couples (X t , d t ), with d t the function 
equal to 1 if the class of X is Ct and elsewhere. The belief 
learning is based on the use of a learning set C given by: 



C 



t = 1, 



,K}, 



(16) 



where mf is the bba defined on O. 

In our case, human expert cannot provide with certainty the 
class of a given tile X, and according to the experts, more 
than one class can be present on the tile X. Hence we cannot 
have the function di that is 1 if the class of X is C\ and 
elsewhere. 

The simple idea of the belief learning for the multilayer 
perceptron is to consider the belief basic assignment in order 
to minimize the error e given by the equation ( fTTT l. Hence, we 
obtain 2' I neurons on the output level and we can stay in the 
credal level. 



C. Decision level 

Usually the decision is taken considering the maximum of 
the values on the output layer. These values are between 
and 1, but the sum is not 1. We can easily normalize them in 
order to interpret these values as belief basic assignment. For 
instance the normalization can be made dividing by the sum 
of the values of the output layer. Hence, the decision can be 
conducted by the maximum of the pignistic probability, or with 
other function such as the credibility or the plausibility. Note 
that if the output layer is composed only with the singletons, 
to consider the maximum of the values or the maximum of 
the pignistic probability is the same. 



V. Illustration 



A. Database 



Our database contains 42 sonar images provided by the 
GESMA (Groupe d'Etudes Sous-Marines de l'Atlantique). 
These images were obtained with a Klein 5400 lateral sonar 
with a resolution of 20 to 30 cm in azimuth and 3 cm in range. 
The sea-bottom depth was between 15 m and 40 m. 

Three experts have manually segmented these images giving 
the kind of sediment (rock, cobble, sand, silt, ripple (hori- 
zontal, vertical or at 45 degrees)), shadow or other (typically 
ships) parts on images, helped by the manual segmentation 
interface presented in figure [3] All sediments are given with 
a certainty level (sure, moderately sure or not sure), and the 
boundary between two sediments is also given with a certainty 
(sure, moderately sure or not sure). Hence, every pixel of every 
image is labeled as being either a certain type of sediment 
or a shadow or other, or a boundary with one of the three 
certainty levels. We choose the weights: 2/3, 1/2 and 1/3, for 
respectively the certainty levels: sure, moderately sure and not 
sure. 



Fig. 3. Manual Segmentation Interface. 

B. Experts Fusion 

In order to obtain a kind of reality for learning task, we first 
fuse the opinion of the three experts following the presented 
model. We note A for rock, B for sand, C for cobble, D for 
silt, E for ripple, F for shadow and G for other, hence we 



have seven classes and 9 = {A, B, C, D, E, F, G}. We have 
applied our model on tiles of size 64x64 pixels given by: 

m(A) =PAl-C\ +PA2-C2 +PA3-C3 
m(B) =p B \-C\ +PB2-C2 +PB3-C3 
m(C) = pci-Ci +PC2-C2 +PC3-C3 
m(D) = PDl-Cl +PD2-C2 +PD3-C3 
< m{E) =pE\-C\ +PE2-C2 +PE3-C3 (17) 
m(F) = p F \-C\ +PF2-C2 +PF3-C3 
m(G) =PGl-Cl +PG2-C2 +PG3-C3 

m(6) = 1 - (m(A) + m(B) + m(C) 

+ m(D) +m(E) + m(F) +m(G)), 

where c\, C2 and C3 are the weights associated to the certitude 
respectively: "sure", "moderately sure" and "not sure" (e.g. 
here: c\ = 2/3, C2 = 1/2 and C3 = 1/3). Indeed we have to 
consider the cases when the same kind of sediment (but with 
different certainties) is present on the same tile. The proportion 
of each sediment in the tile associated to these weights is 
noted, for instance for A: Pai, PA2 and ^3. 

In order to provide a reality for the learning, the experts 
can be fuse by the non-normalized conjunctive rule or the 
generalized PCR as we see before, and the decision can be 
taken on the maximum of the pignistic probability. The total 
conflict between the three experts is 0.2432. This conflict 
comes essentially from the difference of opinion of the experts 
and not from the tiles with more than one sediment. Indeed, 
we have a weak auto-conflict (conflict coming from the 
combination of the same expert three times). The values of 
the auto-conflict for the three experts are: 0.0841, 0.0840, and 
0.0746. We note a difference of decision between the three 
combination rules giving by the equations (O for the general- 
ized PCR, and (0 for the conjunctive rule. The proportion of 
tiles with a different decision is 1.01% between the generalized 
PCR and the conjunctive rule. However, we cannot evaluate 
on this database which combination rule is the best. 

C. Results 

In order to classify the tiles of size 64 x 64 pixels, we 
first have to extract texture parameters from each tile. Here, 
we choose the co-occurrence matrices approach [1]. The co- 
occurrence matrices are calculated by numbering the occur- 
rences of identical gray level of two pixels. Four directions 
are considered: 0, 45, 90 and 135 degrees. Concerning these 
four directions, six parameters given by [24] are calculated: 
homogeneity, contrast estimation, entropy estimation, the cor- 
relation, the directivity, and the uniformity. This classical 
approach yields 24 parameters. The problem for co-occurrence 
matrices is the non-invariance in translation. Typically, this 
problem can appear in a ripple texture characterization. More 
features extraction approaches can be used such as the run- 
lengths matrix, the wavelet transform and the Gabor filters 
[1]. 

Hence, each tile is represented by the 24 parameters, and we 
can try to classify the tiles by the multilayer perceptron and the 
belief multilayer perceptron. So, the input layer contains 24 
neurons, and the output layer contains 7 neurons (one for each 



class). For the belief multilayer perceptron, the mass calculated 
by the fusion of the three experts according to the model given 
in (ITTb allows the learning. We test the both combination given 
by the conjunctive non-normalized rule (0 and the PCR rule 
0. The mass model gives focal element only on the singleton 
and the ignorance 6. In order to learn only on the singletons, 
we consider only the bba given on the singletons, and we 
renormalize them in order to obtain one for the singleton given 
the maximum belief; the output values are not bba in all the 
case. 

Hence, the output layer of the belief multilayer perceptron 
is composed only by seven neurons (one for each class). Of 
course, it could be more interesting to keep 2 e = 128 neurons 
on the last layer in order to stay in the credal level and keep 
the power of this classifier. However, this is possible only if 
enough data are available for the learning. 

In order to take a decision on bba with the maximum of 
the pignistic probability, we annul the minimum value of the 
output layer then we normalize by the sum of the values. Here 
it is similar to decide on the maximum of the values of the 
output layer, but it is not the same in all the cases as shown 
afterwards. 

On the 42 sonar images, we have 9266 tiles of size 64x64 
pixels. Our database has been randomly divided into two 
parts. The first one (2/3 of the database) is used for the 
multilayer perceptron and the belief multilayer perceptron 
learnings, and the second one for tests. We repeat this random 
division 30 times in order to achieve a good estimation of 
the classification rate, and we analyze the mean percentage 
of good classification rates defined as the number of good 
classified small-images dived by the total of small-images. 

With the non-normalized conjunctive rule, we obtain 
64.49% of good-classification rates (with a confidence in- 
terval of [64.07;64.91]) for the classic multilayer perceptron 
and 65.10% of good-classification rates (with a confidence 
interval of [64.72;65.48]) for the belief multilayer perceptron. 
If the reality is obtained by the generalized PCR, we have 
64.96% of good-classification rates (with a confidence interval 
of [64.44;65.25]) for the classic multilayer perceptron and 
64.84% of good-classification rates (with a confidence interval 
of [64.55;65.39]) for the belief multilayer perceptron. 

The evaluation is made on an unknown reality and so we can 
not say that the experts fusion given by the non-normalized 
conjunctive rule is better than the experts fusion obtained by 
the generalized PCR rule. In the case of the non-normalized 
conjunctive rule, the belief multilayer perceptron gives signif- 
icantly better good-classification rates than the multilayer per- 
ceptron. In the case of the generalized PCR rule, the results are 
not significantly different. However, if we repeat the random 
division 1000 times, we obtain 65.043% of good-classification 
rates (with a confidence interval of [64.97;65.11]) for the clas- 
sic multilayer perceptron and 65.125% of good-classification 
rates (with a confidence interval of [65.06;65.19]) for the 
belief multilayer perceptron, with the reality is obtained by the 
generalized PCR. These results show that, here also, the belief 
multilayer perceptron improves significantly the classification 



rates. 

Another interest of the belief multilayer perceptron comes 
from the decision step. For instance, the cobble can be seen 
like a doubt between the rock and the sand according to 
the size of the tile. Hence, the class C can be rewriten as 
A U B. The learning will be the same that previously, but 
the decision by the maximum of the probability pignistic will 
provide another result. We cannot compare these results with 
the classic multilayer perceptron, because the decision step 
is taken by the maximum of the values of the outputs layer. 
Another example can be done if we consider the class shadow 
like the absence of information, e.g. we can associate this class 
to the ignorance and rewrite the model by: 

m(A) = PAl-Cl +PA2-C2 +PA3-C3 
m(B) = PBl-Ci + PB2-C2 + PB3-C3 
: m(C) =PC\-C\ +PC2-C2 +PC3-C3 

m{D) =VD\-C\ +PD2-C2 +PD3-C3 n Q , 

m{E) = p E \-C\ + PE2-C2 + PE3-C3 
m(G) =PG1-C\ +PG2-C2 +PG3-C3 

m(6) = 1 - (m(A) + m(B) + m{C) 
+ m(D)+m(E)+m(G)). 

The ignorance can be learned and so be represented by 
a neuron on the output layer. Here also, the decision by the 
maximum of the probability pignistic provides another result, 
we cannot compare with the classic multilayer perceptron. 

VI. Conclusions 

We have proposed in this paper two different fusion ap- 
proaches in sonar images processing. The first novelty is the 
experts fusion model, we can apply in many image processing 
problems. Indeed, if some images represent uncertain envi- 
ronments, the reality is unknown and we must compose and 
propose a reality (e.g. in order to train a classifier) from the 
experts opinions. In this kind of environments, experts cannot 
say with certainty what is exactly on the images and we 
have to take into account the doubt of the experts in order 
to describe the images. The second novelty is the multilayer 
perceptron with a belief learning improves significantly the 
classic multilayer perceptron. It could be more interesting to 
keep 2' neurons on the last layer in order to stay in the credal 
level and keep the power of this classifier. Hence, this classifier 
can provide a belief on every subset of 2 e , and the decision 
can be made on this space. The evaluation of this classifier 
must be made on more data sets especially on databases where 
the real classes are known and with data giving in terms of 
belief. The problem of image classification evaluation is very 
hard to solve in uncertain environment [25]. 
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